Robust information extraction from automatically generated speech transcriptions
نویسندگان
چکیده
This paper describes a robust system for information extraction (IE) from spoken language data. The system extends previous hidden Markov model (HMM) work in IE, using a state topology designed for explicit modeling of variablelength phrases and class-based statistical language model smoothing to produce state-of-the-art performance for a wide range of speech error rates. Experiments on broadcast news data show that the system performs well with temporal and source dierences in the data. In addition, strategies for integrating word-level con®dence estimates into the model are introduced, showing improved performance by using a generic error token for incorrectly recognized words in the training data and low con®dence words in the test data. Ó 2000 Elsevier Science B.V. All rights reserved.
منابع مشابه
Mining Broadcast News data: Robust Info Lattices
Fine-grained information extraction performance from spoken corpora is strongly correlated with the Word Error Rate (WER) of the automatic transcriptions processed. Despite the recent advances in Automatic Speech Recognition (ASR) methods, high WER transcriptions are common when dealing with unmatched conditions between the documents to process and those used to train the ASR models. Such misma...
متن کاملManual and Automatic Transcriptions in Dementia Detection from Speech
As the population in developed countries is aging, larger numbers of people are at risk of developing dementia. In the near future there will be a need for timeand cost-efficient screening methods. Speech can be recorded and analyzed in this manner, and as speech and language are affected early on in the course of dementia, automatic speech processing can provide valuable support for such scree...
متن کاملMining broadcast news data: robust information extraction from word lattices
Fine-grained information extraction performance from spoken corpora is strongly correlated with the Word Error Rate (WER) of the automatic transcriptions processed. Despite the recent advances in Automatic Speech Recognition (ASR) methods, high WER transcriptions are common when dealing with unmatched conditions between the documents to process and those used to train the ASR models. Such misma...
متن کاملA comparison of normalization and training approaches for ASR-dependent speaker identification
In this paper we discuss a speaker identification approach, called ASR-dependent speaker identification, that incorporates phonetic knowledge into the models for each speaker. This approach differs from traditional methods for performing textindependent speaker identification, such as global Gaussian mixture modeling, that typically ignore the phonetic content of the speech signal. We introduce...
متن کاملA Comparison of Normalization and Training Approaches for ASR-Dependent Speaker Identification1
In this paper we discuss a speaker identification approach, called ASR-dependent speaker identification, that incorporates phonetic knowledge into the models for each speaker. This approach differs from traditional methods for performing textindependent speaker identification, such as global Gaussian mixture modeling, that typically ignore the phonetic content of the speech signal. We introduce...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Speech Communication
دوره 32 شماره
صفحات -
تاریخ انتشار 2000